Adaptive deep learning for entity resolution by risk analysis

نویسندگان

چکیده

The state-of-the-art performance on entity resolution (ER) has been achieved by deep learning. However, models usually need to be trained large quantities of accurately labeled training data, and cannot easily tuned towards a target workload. in real scenarios, there may not sufficient data; even if they are abundant, their distribution is almost certainly different from data some extent. To alleviate such limitation, this paper proposes novel risk-based adaptive approach for ER that can tune model its workload the workload’s particular characteristics. Built recent advances risk analysis ER, proposed first trains then fine-tunes it unlabeled minimizing misprediction risk. Our theoretical shows correct label status mispredicted instance with fairly good chance. Finally, we empirically validate efficacy benchmark comparative study. extensive experiments show considerably improve models. Furthermore, scenario misalignment, similarly outperform alternatives transfer learning considerable margins. Using as test case, demonstrate promising potentially applicable various challenging classification tasks.

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

DeepER - Deep Entity Resolution

Entity resolution (ER) is a key data integration problem. Despite the efforts in 70+ years in all aspects of ER, there is still a high demand for democratizing ER – humans are heavily involved in labeling data, performing feature engineering, tuning parameters, and defining blocking functions. With the recent advances in deep learning, in particular distributed representation of words (a.k.a. w...

متن کامل

Named Entity Recognition in Persian Text using Deep Learning

Named entities recognition is a fundamental task in the field of natural language processing. It is also known as a subset of information extraction. The process of recognizing named entities aims at finding proper nouns in the text and classifying them into predetermined classes such as names of people, organizations, and places. In this paper, we propose a named entity recognizer which benefi...

متن کامل

Corefrence resolution with deep learning in the Persian Labnguage

Coreference resolution is an advanced issue in natural language processing. Nowadays, due to the extension of social networks, TV channels, news agencies, the Internet, etc. in human life, reading all the contents, analyzing them, and finding a relation between them require time and cost. In the present era, text analysis is performed using various natural language processing techniques, one ...

متن کامل

Deep Active Learning for Named Entity Recognition

Deep neural networks have advanced the state of the art in named entity recognition. However, under typical training procedures, advantages over classical methods emerge only with large datasets. As a result, deep learning is employed only when large public datasets or a large budget for manually labeling data is available. In this work, we show that by combining deep learning with active learn...

متن کامل

Linking entity resolution and risk

A major emerging problem among consumer finance institutions is that customers that are not well recognized might be riskier than customers that are fully recognized. Fortunately, financial institutions count with external vendors databases that indicate the level of recognition of their customers. However, this information is normally presented as features with partial scores that must be aggr...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

ژورنال

عنوان ژورنال: Knowledge Based Systems

سال: 2023

ISSN: ['1872-7409', '0950-7051']

DOI: https://doi.org/10.1016/j.knosys.2022.110118